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Abstract 

A bistochastic matrix B of size N is called unistochastic if there exists a uni- 
tary U such that Bij = \Uij\^ for i,j = l,...,N. The set U3 of all unistochastic 
matrices of order TV = 3 forms a proper subset of the Birkhoff polytope, which 
contains all bistochastic (doubly stochastic) matrices. We compute the volume 
of the set U3 with respect to the flat (Lebesgue) measure and analytically eval- 
uate the mean entropy of an unistochastic matrix of this order. We also analyze 
the Jarlskog invariant J, defined for any unitary matrix of order three, and 
derive its probability distribution for the ensemble of matrices distributed with 
respect to the Haar measure on U{3) and for the ensemble which generates the 
flat measure on the set of unistochastic matrices. For both measures the proba- 
bility of finding | J| smaller than the value observed for the CKM matrix, which 
describes the violation of the CP parity, is shown to be small. Similar statistical 
reasoning may also be applied to the MNS matrix, which plays role in describ- 
ing the neutrino oscillations. Some conjectures are made concerning analogous 
probability measures in the space of unitary matrices in higher dimensions. 
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1 Introduction 



Bistochastic matrices appear in variety of problems in different branches of 
science. A bistochastic matrix (also called doubly stochastic) contains real non- 
negative entries, the sum of which in each column and in each row is equal to 
unity. Thus each column and each row of such a matrix can be interpreted as 
a probability vector. The structure of the set Bn of all bistochastic matrices 
of order N is well understood [1]. It is formed by the convex polytope of all 
permutation matrices of size N and it is often called the Birkhoff polytope [2] . 
Analytical expressions for its volume for small dimensionality [3,4] and for the 
leading terms of the volume in the asymptotic limit [5,6] are available in the 
literature. 

In various physical problems it is assumed that the probabilities, which form 
the entries of a bistochastic matrix, arise as squared modulus of an element of 
an (a priori unknown) unitary matrix. A bistochastic matrix B which can 
be generated from an unitary matrix U by the relation Bij = |J7ij|^ is called 
unistochastic (or orthostochastic). 

For instance, such matrices arc used in high energy physics to characterize 
interactions between elementary particles, which can be divided into N gen- 
erations. Since the Hamiltonians describing two kinds of physical interactions 
(usually called 'strong' and 'weak') do not commute, these two Hcrmitian op- 
erators determine a single unitary matrix of order N, which relates both eigen- 
bases. This is the famous unitary matrix of Cabbibo-Kobayashi-Maskawa [7]. 
Note that the squared moduli of the CKM matrix Vckm form the corresponding 
unistochastic matrix B, the entries of which represent probabilities which are 
accessible experimentally [8-12]. 

According to the Standard Model of elementary particles there exist three 
generations of quarks, thus the case of a direct physical importance are unis- 
tochastic matrices of order N = 3. On the other hand, the case N = 4 could 
also become relevant in case a fourth generation of quarks should be discov- 
ered [13-15]. A similar problem in the neutrino physics is characterized by the 
Maki-Nakagawa-Sakata matrix (MNS matrix) [16], some parameters of which 
are still quite uncertain. A relation between the MNS and CKM matrices is 
studied in [17]. 

In practice, given a bistochastic matrix B G Bn it is important to know, 

whether it belongs to the set Un of unistochastic matrices. If this is the case 
one would like to describe the set of all unitary matrices such that B^j = |J7ij|^ 
for i,j = 1, . . . ,N. These questions are of a particular interest for research in 
foundations of quantum mechanics and investigation of properties of transition 
probabilities [18-20], scattering theory [21], quantum counterparts of Markov 
processes and dynamics on graphs [22,23], and the theory of quantum informa- 
tion processing [24] . 

Any bistochastic matrix of order two is unistochastic, so both sets coincide, 
U2 = B2. The situation differs already for A'' = 3. To show this fact Schur 
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considered the symmetric combination of cycle-three permutation matrices, P 
and P-^ = P^. This matrix 



Bs = liP + P^) = l 



1 1 

1 1 
1 1 



(1) 



is clearly bistochastic but it is easy to see that there is no corresponding unitary 
matrix. Hence U3 C B3 and a similar relation holds for an arbitrary N > 
3. Vaguely speaking, the moduli of a unitary matrix need to fulfill certain 
constraints, more stringent than the obvious fact that the sum of squared moduli 
in each row (each column) is equal to unity. As recently analyzed by P. Di^a [25] , 
this simple observation has important consequences for reconstruction of unitary 
matrices from experimental data. 

A general question, if a given bistochastic matrix B is unistochastic, remains 
open, and only partial results are available [26,27]. Necessary and sufficient con- 
ditions for imistochasticity are known for TV = 3 [9,28,29], while the constraints 
for unistochasticity recently obtained by Di^a [25] for the general case of > 4 
are formulated implicitly and do not provide a constructive solution of the prob- 
lem. 

Geometrical properties of the set of U3 of unistochastic matrices of order 3 
were studied in [27, 28] . This set contains a 4-dimensional unistochastic ball 
centered at the flat (van der Waerden) matrix W, for which all entries are equal 
to 1/3. The set U3 is not convex but it is star shaped. This means that if 
B GU3 then the entire interval BW belongs to the set. The volume of U3 with 
respect to the Euclidean (Lebesgue) measure was numerically estimated by a 
Monte Carlo- type procedure [27]. 

The main aim of this work is to derive an analytical formula for the volume 
of the set U3 of unistochastic matrices of order 3 with respect to the flat, Eu- 
clidean measure. We obtain also a compact expression allowing one to average 
any function of elements of B over the set U3. In particular we derive an explicit 
result for the average generalized entropies Sq which in the special case q = 1 
gives the mean Shannon entropy of the columns of unistochastic matrices. We 
compute also the average Jarlskog invariant J proportional to the area A of the 
unitarity triangle, which characterizes any unistochastic matrix [8,9]. Further- 
more we derive higher moments (J'^) and analyze the probability distribution 
P(J), to get more insight into properties of the CKM matrix, which describes 
violation of the CP symmetry. Such an approach was recently suggested by 
Gibbons et al. [30], who used other probability measures for this purpose. 

All averages are computed with respect to the natural Euclidean measure 
in the set of unistochastic matrices and are compared with the averages with 
respect to the measure induced by the Haar measure on 11(3). This measure 
leading to the unistochastic ensemble [26] , called flag-manifold measure in [30] , 
is not uniform in the set U3. Although approach presented, based on properties 
of intertwining operators associated to the group 53 [31,32], is directly applicable 
to the case N = 3, we conjecture also some properties of the measures in the 
sets of unistochastic matrices in higher dimensions. 
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The paper is organized as follows. In the next two sections we present 
the necessary definitions and review key properties of the Birkhofl: polytope S3 
and its subset U3 containing unistochastic matrices. In section 4 we define a 
family of measures in the set W3 and compute its volume with respect to them. 
Similar study of average entropies is presented is section 4 while the average 
Jarlskog invariant and its distribution are investigated in section 5. The paper 
is concluded with section 6, while some conjectures concerning the measures in 
the set of unitary matrices for > 4 are relegated to the Appendix. 

2 The Birkhoff polytope 

A real square matrix B of order N is called bistochastic (or doubly stochastic) 
if it satisfies the following conditions 

i) Bij > ii) ^ Bij = 1 iii) J2 Bij = 1 . (2) 

i 3 

Such a matrix B is often used to describe discrete dynamics, p' = Bp, in 

the space of probability vectors. Condition i) implies that all elements of the 
transformed vector p' are non-negative. Due to condition ii) its 1-norm "^^Pi, 
is preserved. A matrix satisfying two first conditions is called stochastic and 
it sends the simplex of A'^-point probability vectors into itself. Condition iii) 
implies that additionally the transposed matrix B^ is stochastic, which explains 
the name. 

The uniform probability vector p^ with all components equal, pi = 1/N 
stays clearly invariant with respect to any bistochastic matrix, Bp^ = p^ . Thus 
a bistochastic matrix describes a (weak) contraction of the probability simplex 
towards the uniform distribution p^^ . 

Let Bn denote the set of all bistochastic matrices of order N, called BirkhoS 
polytope. This convex polytope is well known in linear programming. Since it 
arises in the problem of assigning A'' workers to N tasks, given their efiiciency 
ratings for each task, it is sometimes called the assignment polytope. 

The BirkhoS^ polytope is equivalent to the convex hull of all A''! permutation 
matrices of size A'^. Hence a permutation matrix P forms an extremal point 
of Bn- All corners of Bn are equivalent in the sense that a given corner can 
be obtained from another one by an orthogonal transformation. A bistochastic 
matrix belongs to the boundary of the Birkhoff polytope if and only if at least 
one of its entries is equal to zero. 

There exists a unique bistochastic matrix W, with all entries equal, Wij = 
jj. It is also called a matrix of van der Waerden, since it saturates the van 
der Waerden inequality [1] concerning the permanent of bistochastic matrices, 
per(i?) > N\N~^ . It is easy to see that W is located symmetrically at the 
center of the Birkhoff polytope. 

A bistochastic matrix B can be determined by its minor of size (A'^ ^ !)• 
Hence the dimensionality of the Birkhoff polytope Bn equals (A^ — 1)^. For 
instance, the dimension of the set B2 is equal to one, and this set forms indeed an 
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interval between the identity matrix I2 and the 2-element permutation matrix. 
In other words, any bistochastic matrix of order two can be written as 



B2{a) 



a 

1 - a 



1 -a 
a 



where a e [0, 1]. 



(3) 



The length of this interval, equivalent to volume of B2, is equal to unity, if 
we consider it as a subset of . However, we are going to consider this set as an 
element of then the distance between the points (1, 0, 1, 0) and (0, 1, 0, 1) 
is equal to 2, so in these units one has voli(i32) = 2. 

In this paper we are going to work with the case = 3, so the Birkhoff 
polytope is defined as a convex hull of 3! = 6 permutation matrices. 



(4) 
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13 



23 
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1 



(5) 

We divided the permutation matrices into two triples, which belong to two 
totally orthogonal 2-planes. A uniform mixture in any triple produces the flat 
matrix W, 



P + P^ 



1 = W = P12+P13 + P23 . (6) 

Working with the standard Hilbert-Schmidt distance, defined by D^{A,B) := 
Ti-{A — B){A — B)* , we see that both triples form equilateral triangles. To pro- 
duce a sketch of them in 4 dimensions we will use the following parametrization 



B{h) = B{biMMM 



hi 



b2 

62- 



61 
63 



&2 
&4 



(7) 



Both triangles shown in Fig. [T^, cross at their center W . Six permutation 
matrices form 15 edges, out of which all belong to the boundary of B-^ and all 
are extremal. There are six long edges, of length a/6, which form two equilateral 
triangles, and nine short edges of length 2. If one plots all of them, as in Fig. [T]3, 
the sketch of S3 becomes complete, but not very illuminating. Another natural 
possibility is to look at the polytope 'from above', the direction distinguished 
by the vector h = (1/3, 1/3, 1/3, 1/3). The Birkhoff polytope then appears sym- 
metrically as a regular hexagon with two inscribed equilateral triangles forming 
the Star of David - see Fig. [1}:. Note that all the diagonals of the hexagon 
belong to the boundary of S3, and that the distance PP12 of the diagonal is 
shorter than the side PI of the equilateral triangle. Any 2-d face of the polytope 
is formed by an isosceles triangle with two short edges and one long. 
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Figure 1: BirkhofF polytope for A'' = 3 plotted in parametrization ([7]), a) two 
orthogonal equilateral triangles centered at the flat van der Waerden matrix W, 
b) all 15 edges determining the polytope, c) the same polytope as seen 'from 
above'. 

The polytope B3 is defined as the convex hull of 6 corners, so one could think, 
it may be decomposed into two 4-d simplices, each determined by 5 points. This 
would be possible, if we could select 4 corners, which span the base of a simplex 
and then allow two other corners to play the role of an apex for two simplices, 
with the same base. However, this would require that the edge connecting both 
apexes is not extremal or it includes one of the corner from the base of the 
simplex. Neither of these holds for the Birkhoff polytope, so its decomposition 
into two simplices is not possible. 

To find a decomposition of B3 into three simplices take three corners of one 
equilateral triangle, e.g. A(P, P^, 1). Out of the orthogonal triangle select a 
side, say the one formed by the corners P12 and P13. These five corners define 
a 4-d simplex. The same construction performed for two other sides of the 
A(Pi2, -P13, P23) produces two other simplices. It is easy to show any point of 
B3 belongs to one of these simplices and that the 4-d volume of any of their 
intersections is equal to zero. Such a triangulation of the Birkhoff polytope 
allows to find that its volume in according to the Lebesgue measure is equal 
to 9/8. A detailed investigation of the geometry of the Birkhoff polytope is 
provided in [33]. 

3 The set of unistochastic matrices 

A certain class of bistochastic matrices can be generated from unitary matrices. 
Let U denote a unitary matrix. Unitarity condition, UU* = 1, implies that the 
matrix B defined by 

B = f{U), so that B,^ = \Uij\^ (8) 

is bistochastic. Any bistochastic matrix B G Sjv for which there exists uni- 
tary U G U{N) such that B = f{U) is called unistochastic. The set of all 
unistochastic matrices of size A'' will be denoted as Un- 

Note that the multiplication of U by any diagonal unitary matrices Di and 
D2 changes the phases of entries of U, but does not modify the corresponding 
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bistochastic matrix. Hence we define an equivalence relation 



U kU' ^ D1UD2 



(9) 



and observe that B = f{U) = f{U'). 

If the unitary matrix U, appearing in ([8]) is orthogonal, the corresponding 
bistochastic matrix B is called orthostochastic^ This is the case for any bis- 
tochastic matrix of size 2, since writing an orthogonal matrix O = ( f°in^ cos^ ) 
and taking i? = arccosa we see that B = f{0) for any bistochastic matrix of 
order two represented in the form ([3]) Therefore any unistochastic matrix of 
order 2 is also orthostochastic. This is no longer the case for > 3, as it is 
explicitly shown later in this section. 

Interestingly this simple mathematical observation has far reaching conse- 
quences for physics. In the theory of elementary particles one defines a discrete 
space-time symmetry called CP, which stands for charge conjugation and par- 
ity. Such a symmetry requires that a physical process in which all particles are 
replaced by their antiparticles is equivalent to the mirror image of the original 
process. 

If such a symmetry were obeyed the CKM matrix Vckm would be orthogonal, 
(and thus invariant with respect to the complex conjugation) or it would be 
equivalent to an orthogonal matrix with respect to (O. For N = 2 this is the 
case for any unitary matrix from U{2). As the CP symmetry was discovered 
in 1964 to be violated in experiments on decay of neutral mesons K, one could 
predict that the number N of generations of quarks in the theory, equal to the 
size of the CKM matrix, has to be greater than two. 

In her first paper on the CKM matrix [8] Jarlskog observed that for any 
unitary matrix U of size 3 the number 



is invariant with respect to multiplication of the matrix U by diagonal unitary 
matrices and permutations. This quantity, now called the Jarlskog invariant, 
computed for the CKM matrix Vckm can be considered as a measure of the 
violation of the CP symmetry. 

Consider now an arbitrary bistochastic matrix B of size N. To check if 
this matrix is unistochastic we need to know whether there exists a unitary 
matrix U such that f{U) = B according to eq. ([8]). The moduli of the unitary 
matrix are determined by the square roots of the entries of the bistochastic 
matrix, \Uij \ = y^Bij and one needs to find a set of phases (pij which guarantees 
unitarity, where Uij = \Uij \ exp (i(/)y ). 

Choose the two first columns of U, which we denote as \ui) and |it2). Their 
orthogonality relation, (ui|u2) = 0, implies that 



(10) 



N 



In some papers this name is used for unistochastic matrices as well. 
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Introducing the notation Lj = \Uji \ ■ \Uj2\ and 9j — — (j)j2 we may rewrite 
this relation as 

N 

^L,exp(i0,) = O. (11) 

This form has a nice geometric interpretation: given a set of N line segments 
of lengths Li, . . . Ljy we need to find the phases 6j in such a way that the entire 
chain is closed. Obviously it cannot be done unless the longest link is shorter 
or is equal to the sum of all other links. We are free to change the order of 
summation in (jlip and hence to relabel the links in such a way that they are 
ordered non increasingly, Li > L2 > • • ■ > L^. Then the chain links condition 
reads 

Li < L2 + --- + Ln . (12) 

If it is satisfied the chain can be closed and forms a unitarity polygon. 

This relation was imposed by the assumed orthogonality of the first two 
columns of U, but analogous conditions should be fulfilled by all links corre- 
sponding to any pair of columns of U. Similar conditions are due to the orthog- 
onality between any two rows of U. This implies the total number of N{N — 1) 
constraints of the form ([72]) . some of which can be dependent [26,35]. However, 
there is an example of a bistochastic matrix B of order four, which satisfies all 
chain links condition for all pair of rows, but not for all pair of columns [26,36], 
so in practice one has to check rows and columns separately. Furthermore, for 
TV > 4 these conditions for B are only necessary but not sufficient to imply 
unistochasticity. 




Figure 2: Chain link condition for unistochasticity: a) a unistochastic matrix 
with a positive area of the unitarity traingle, > 0, b) limiting case: an 
orthostochastic matrix with = 0, c) a bistochastic matrix B not included 
into for which < 0. 

It is comforting to realize that the situation gets simpler for iV = 3. In this 
case the chain links relation for the first two columns reduces to the triangle 
inequality, 

IL2-L3I < Li < L2 + L3 , (13) 

and the first constraint is required if we relax the assumption that the links 
are ordered decreasingly. Although in general one should check similar relations 
stemming to other pairs of columns and rows of U, in the case = 3 the last 
column by construction has the right moduli and does not impose any further 
restrictions [29]. Thus in this case the relation allowing a chain to close is 
sufficient for unistochasticity [9,28], and explicit formulae for the phases 0y are 
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provided below. If there is a bistochastic matrix B such that in all relations 
(fT3|) equality takes place, the phases 9j are equal to zero or to tt. Thus the 
corresponding matrix U is orthogonal, which means that B is orthostochastic. 
It is easy to show that a matrix B belongs to the boundary of the set U3 if and 
only if B is orthostochastic. The set U3 forms a 4-dimensional subset of B3 of 
a positive measure, while the set of orthostochastic matrices, at the boundary 
of is three dimensional [27,28]. 

For any given B of order three it is straightforward to check whether link 
conditions are fulfilled, so that B is unistochastic. For instance nine short 
edges, (of length 2) of the Birkhoff polytope belong to U3, while the long edges 
(of length ^/6) do not belong to this set. 

Let us take three such edges spanned by P, and 1, which form the equi- 
lateral triangle. At this plane, the set of orthostochastic matrices, for which 
Li = L2 + L3, forms a deltoid - see Fig. [3^. This figure also called 3- 
hypocycloid, may be obtained by sliding a circle of radius 1/3 inside the unit 
circle. Thus the set of unistochastic matrices corresponds to the interior of the 
deltoid and is not convex. This set contains the maximal unistochastic ball of 
radius r — \/2/3 centered at W, which touches the boundary at the deltoid. 

Incidentally, the very same figure is related in a different way with the set of 
unistochastic matrices of order three. The spectra of these matrices are real or 
belong to the deltoid inscribed into the unit disk and stemming from the real 
eigenvalue equal to unity [26]. 




a) b) P12 c) P12 

Figure 3: Non convex set U3 of unistochastic matrices forms a proper subset 
B3. a) Deltoid obtained by the cross-section of along the plane spanned 
by the equilateral triangle A(P, P^, 1), b) a similar cross-section along totally 
orthogonal plane, c) a view 'from above' as in [2}:. 

Consider a unistochastic matrix B parametrized by The length of the 
links read 

Li = VM^, L2 = Vhh, L3 = ^{l-bi-b2){l-h-b4), (14) 

and the triangle inequality p3p provides the direct condition for unistochasticity. 
Let us write down the area A of this unitarity triangle with sides Li,L2, and 
L3, and semiperimeter p — {Li + L2 + ^3)72. Making use of the Heron's formula 

A = Vpip-Li)ip-L2){p-L3) , (15) 
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and substituting (|14p we arrive with a compact expression for the squared area 
It will be convenient to work with this quantity multiplied by sixteen, 
Q{h) := 46162&364 - (5i + ^2 + ^3 + 64 - 1 - ^'164 - b2hf = 16^2. (16) 

Here h = {61, 62, ^^s, ^'4} represents a vector in which determines a bis- 
tochastic matrix in parametrization ([7]). In fact we can form six unitarity trian- 
gles in this way, depending on what pair of columns or rows we wish to choose. 
Although their shapes differ due to unitarity their area A is the same [9] so the 
quantity A does not change under permutation of the unitary matrix U . 

It is easy to see that all chain-links conditions are equivalent to the single 
condition for unistochasticity, 

A^{B) > . (17) 
In other words B (6) G if and only ii b E H. where 

n-.^ {beR^ ■.bi> 0, b2 > 0, foi + ^2 < 1, Q (6) > 0} . (18) 

Also n is the closure of the connected component of {6 : Q (6) > 0} which 
contains (see [31, Sect. 2]). 

We relate these expressions to one of the standard parametrizations of a 
unitary matrix (see Di^a [25, p. 11]). For any U' € U (3) there are diagonal 
matrices Di,D2 £ U {3) such that DiU'D2 — 



U 



C12 S12C13 S12S13 

S12C23 -C12C13C23 - e"^Si3S23 e"'ci3S23 - C12C23S13 
S12S23 e"'c23Sl3 - C12C13S23 -C12S13S23 - e'*Ci3C23 



(19) 



The parameters are the angles 6*12, ^13, ^23, <5 and Cjk ■= cos 9 jk,Sjk ■= 
sm9jk,0 < 9jk < f for 1 < j < fc < 3. The function ([5]) determines thus 
a unistochastic matrix B (b) — f {U) and its entries read 



bl = C12, b2 = S?2C?3' ^3 = S12C23' (20) 
64 = C12C13C23 + S13S23 + 2C12C13C23S13S23 cos (5. 

We explain how these parameters are used to determine the phases (pij for 2 < 
< 3. We consider only the nondegenerate case in which all entries of U are 
nonzero. This implies Cjk > and Sjk > for 1 < j < A: < 3. The equations ([20]) 
determine 6 up to the sign (in — tt < 5 < tt). The fact that U and U := \Uij] . 
produce the same values for 61 , . . . , 64 causes this ambiguity. We will adopt the 
normalization Im[/22 > 0. This forces sin (5 < 0,Imt/32 < 0,ImC/23 < and 
Im[/33 > 0. The cosines of the phases are computed using the entries in ([TO]) . 
The phases are related to the (interior) angles of the unitarity triangles derived 
from columns 1 and 2, and from columns 1 and 3. For the former case, using 



10 



the lengths from equation (fH|) and denoting the angles Oi,02, 63 by the label on 
the opposite side, we find 

^ Ll + Ll-Ll 61&2 + 6364 63) (1-62-^4) 

cos 1/3 = 



COS U2 



2LIL2 2^6152^364 

COS 022 , 

3 - ^2 _ "1"2 -r {-L- Ul - U3) l^J- - U2 - "4; - "3"4 



Lj + Lj- Lj ^ b,b2 + (1 - 61 - 63) (1 - 62 - 64) - 636 
2iii3 2^6162 (1 - 61 - 63) (1 - 62 - 64) 

= - COS 032. 

From the conditions sin 022 > and sin 032 < we obtain 022 =71 — 63 and 
032 = 02 — n (note the interior angles satisfy < 61,62,03 < tt thus < 022 < tt 
and — TT < 032 < 0). By interchanging columns 2 and 3 we find the remaining 
nonzero phases (we use matrix entry notation from equation ^ for more concise 
statements): 

6ii?i3 + 63i?23 — B31B33 . 

cos 023 = —====== , sm023 <O; 

ZV0l03£>i3i323 

biBi3 + B31B33 — 63^23 . , ^ n 

cos 033 = ^ , Sm033 > 0. 

2v6li?l3-B3li333 



For the example where each hi = i the unitarity triangle is equilateral, each 
6i = ^ and the above equations give 022 ~ ^ — 033 and 032 — = 023- 

We return to the consideration of the Jarlskog invariant. A straightforward 
computation yields 



2 Re {UnU22U*2U;^) = 1 - 61 - 62 - &3 - 64 + hb^ + 6263, (21) 
and thus the square of the Jarlskog invariant (fTO|) reads 



[Im (C/ll[/22f/r2C/2*l)] ' = hb2b3bi - [Re (C/iif/22C/r2C/2*i)] ' (22) 

Substituting expression (|2ip into above equation and comparing the outcome 
with (flB]) we see that 

J2 = ig(6) = AA' (23) 

Thus the squared Jarlskog invariant, proportional to the squared area of the 
unitarity triangle, may also be defined as in p6|) for an arbitrary bistochastic 
matrix B G B3. For simplicity we shall write according to the context J = J{U) 
or J = J{B) = J{B{f{Uy), as it should not lead to misunderstanding. 

The squared Jarlskog invariant, J^, is equal to zero if and only if B is 
orthostochastic, so there exists an orthogonal matrix O, such that Bij = Ofj. 

Following Haagerup [37] we shall call two unitary matrices Ui and U2 equiv- 
alent, written Ui ^ U2, if there exist two diagonal unitary matrices Di and D2 
and two permutation matrices Pi and P2 such that 

U2 - DiPi ■ Ui ■ P2D2 . (24) 
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Observe that due to permutation matrices this relation is more general than the 
relation ([9]). 

Since multiplication by phases or permutations do not vary the area of the 
unitarity triangle we see that the squared Jarlskog invariant of two equivalent 
unitaries are equal, if Ui « U2 then J'^{Ui) = J^{U2)- Going back to the set 
of unitary matrices U{3) we see that J{U) = if C/ is orthogonal, or more 
generally, if U is equivalent to an orthogonal matrix, U = DiPi ■ O ■ 

Thus (U) measures to what extend the matrix U can be transformed into 
an orthogonal matrix by means of enphasing and permutations. It is easy to see 
that J is maximal if the unitarity triangle is equilateral, Li — L2 = ^ 1/3 
so that J^ax = 1/108 - see Appendix El This is the case for the flat matrix W 
of van der Waerden, which corresponds to the unitary Fourier matrix 



1 1 1 

1 exp(i • 27r/3) exp(i • 47r/3) 
1 exp(i • 47r/3) exp(i • 27r/3) 



(25) 



which is an example of a complex Hadamard matrix of order three. Such a 
unitary matrix H of size N is distinguished by an extra condition that all its 
complex entries have the same modulus, \Hij\'^ — 1/N for i, j ^ 1, . . . N [38]. 

Any complex Hadamard matrix H of order three is known to be equivalent 
to the Fourier matrix F3 [37], which implies that J^{H) = J^^x- From this 
perspective the set of Hadamard matrices is maximally distant from the set of 
orthogonal matrices. While complex Hadamard matrices correspond to the flat 
bistochastic matrix W , located at the center of the set of unistochastic matrices, 
the boundary of which is formed by the set of orthostochastic matrices. 

On the other hand for any bistochastic matrix which is not unistochastic, 
the quantity — Q/A defined by ^6]) is negative. Since Q — IQA^ one might 
say that in this case the area of the unitarity triangle is imaginary, since the 
three segments Li cannot be closed to form a triangle. Among all bistochastic 
matrices of size three the quantity Q is the smallest for the matrix Bs of Schur 
((T]) for which Q = —j^^ see Appendix VK\ Indeed, looking at Fig. [3^ we see that 
Bs is such a point of the Birkhoff polytope S3, for which the distance to the set 
IA3 of unistochastic matrices, represented by the gray deltoid, is maximal. 



4 The volume of the set of unistochastic matri- 
ces of order three. 

Since unistochastic matrices of order three are used in various branches of theo- 
retical physics, several geometric properties of the set of these matrices were 
studied in [27]. In particular, in that work the volume of this set was estimated 
numerically. In this section we shall improve these findings by deriving an an- 
alytical formula for this volume. To this end we need to introduce probability 
measures into the set of unistochastic matrices. A first natural choice will be 

a) the flat (Lebesgue) measure /i3/2 used before in [27]. 
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The above notation is due to the fact that this measure belongs to a one- 
parameter class of measures /ifc, defined below. In general any probability mea- 
sure in the set U (N) of unitary matrices induces by function ([5]) a measure into 
the set ZYjv- Thus we will distinguish the case 

b) the measure /zi induced by the Haar measure on f/(3). 

This measure leads to the unistochastic ensemble or random unistochastic 
matrices [26]. Since it is related to the unitarily invariant measure on the flag 
manifold U (3) / [U (1)]'^ it was called Bag-manifold measure in [30]. 

Suppose now that dm denotes the normalized Haar measure on f/(3), db 
is Lebesgue measure on R^, and g is a continuous function on [0,1]^ C M^. 
Due to [31, Theorem 2.1] one may relate the integrals over the space of unitary 
matrices U{3) and over the set G R^, see eq. (fTSl) . which determines the set 
of unistochastic matrices, 



-'/'db. 



9(\Uii\\\Ui2\\\U2i\\\U22f)dm^- ( g{biMMM)Q{b) ' 

'(7(3) ^ ^ JQ. 

(26) 

The function g{bi, 62, ^3, 64) = g{b) determines the function g{B) defined on the 
entire set of unistochastic matrices. 

We introduce new coordinates (&i, s, i, r) with 

&2 = s(l-6i), 63-t(l-6i), 



64 = (1 - s) {l-t) + bist + 2rV&is<(l-s)(l-t). (27) 

Note that s = C23, t — 0^3, r = cos S in terms of the matrix U. 

Then Q = 46i (1 - bif s {1 - s) i (1 - t) (l - r^) , and Q corresponds to 

{(6i,s,i,r):(fei,s,i)e[0,l]\-l<r<l}. 

The Jacobian for the change-of-variables is 
5 (&i, 62, 63,^4) 



d{bi,s,t,r) 



2 (1 - biY y/bisil^s)til-t). 



Making use of the expressions derived in [31] we may write an explicit form 
for an integral of a continuous function g with respect to the measure Q{bY^^/^ 
for arbitrary k > 1/2. Hence it is natural to introduce a one-parameter family 
of measures /i^ on lA^ which satisfy 

g{B)dfik= ( g{b)Q{bf-^'-'db. (28) 

Thus the case fc = 3/2 corresponds to the fiat (Lebesgue) measure on U^, 
while for /c = 1 this expression reduces to (|26|) . so ^1 represents the measure 
induced by the Haar measure on t/(3). 
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Integral at the right hand side of can be rewritten as 

g (b) Q {bf-i db - /' /' /' /' g (61, s (1 - 61) , i (1 - 61) , 64) (29) 



n Jo Jo Jo J-i 



xb\-^ (1 - bif^'^ [4s (1 - s) t (1 - t)f'^ (1 - r'^f ^ dr ds dt dbi, 



where 64 is given by (l27j) . Setting g = 1 and using the standard beta integrals 
we determine the normalization constant [31, Prop. 3.2]) 

Suppose n = 1, 2, 3, . . .: then 



'■n+l/2 



(2n-l)(3n-l)!' 



2n (n+i) 



2n+l 



where (x)^ := 0"^]^ (x + i — 1) stands for the Pochhammer symbol. In partic- 
ular, 

= 3T5T7 = 105- 

gives the volume of considered as a subset of M.*. This is multiplied by 9 to 
produce the volume relative to [5] - see Appendix [Al 

Thus the ratio of the 4-dimensional volume of in to the volume of all 
bistochastic matrices is 

This is in agreement with the outcome of earlier numerical calculations [27, eqn. 
(24)] which were based on roug hly 10"^ sample points and yielded 0.7520±0.0005. 

For completeness let us add that the volume of the set with respect to 
the flag-manifold measure fix reads hi = tt/2. 



5 Mean entropy of a = 3 unistochastic matrix. 

The Shannon entropy of an iV-point probability vector p — (pi , . . . pi^ ) is defined 

by 

N 

S{p) - ^Pi^np^ . (33) 

1=1 

This quantity measures to what extent the vector is mixed and varies from 
for any pure vector (1,0, .. .0) to IniV for the maximally mixed vector = 
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(1/7V, . . . 1/N). In an analogous way one defines the entropy of a bistochastic 
matrix 



equal to the average entropy of its rows (or columns). For any permutation 
matrix P this entropy is equal to zero while its maximum value In N is attained 
at the flat matrix W. The mean entropy of a bistochastic matrix was considered 
by Slomczynski [39] and later analyzed in [26,40]. 

To derive an expression for the average entropy of a unistochastic matrix 
with respect to any probability measure on U3 bi-invariant under the symmetric 
group 53 we observe that is equal to the expected value of — 3-Bii InBu. This 
can be computed with respect to the probability measure hj^ Q [b) ^ db on f2. 

Let us denote the mean entropy by {S)qi^, where parameter k labels the 
measure defined in ((28)) . From (|29|) specialized to functions of 61 we obtain 



-3r {3k) r\ 
r(fc)r(2fc)X ' 

7/;(3fc+ 1) - V(fc + 1), (35) 



where the digamma function reads ijj (x) :— F' (x) /F (x) ,x>0. 

The recurrence relation -0(0; + 1) = ip {x) + 1/x is used in the following: 
Suppose n — 1, 2, 3, • • • then 

3n ^ 3n+l ^ 

]=n+l ■> ■j=n+l ■' 



0.1 



In particular the average entropy for the flag-manifold measure /ii gives {S) 
I = 0.833..., This result coincides with the average entropy of random complex 
vectors [41,42] which form a unitary matrix. 

The mean entropy with respect to the flat measure /i3/2 onlA^ reads (<S')jj 3 = 

2(i + i + i) = |f| = 0.90793651 . . .. This quantity was approximated as 0.908 
in [27, eqn. (25)]. 

These data can be compared with the maximal entropy S'max = ln3 w 1.099, 
characteristic of the flat matrix W of van der Waerden. 

For comparison let us now compute the mean entropy with respect to the 
Lebesgue measure averaged over the entire set of bistochastic matrices. 

Straightforward calculation allows us to integrate functions of bi , 62 over 
with respect to the flat measure db := Y^=i ^^i- general, we consider an ar- 
bitrary function /, integrable on the triangle with vertices {(0, 0) , (1, 0) , (0, 1)}. 
The result is 

/ (61, 62) db^ f dbi f ' / (&i, 62) {blb2 + (bl + 62) (1 - 61 - &2)) db2. 



83 "'0 "'0 



(37) 
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The corollary stated in the Appendix specializes this to f {bi) db and allows 
us to find the volume of the BirkhofF polytope. Thus relative to MP one has 
V0I4 (63) = 9/8. 

Formula ([37]) can also be used to compute the average entropy, equal to the 
expected value of (— 36iln6i). The analytic result 



8 / {~3bilnbi) db 



53 
60' 



agrees with the numerical estimation 0.883 obtained earlier in [27]. Note that 
this number is smaller than the mean entropy (S)^ 3 averaged over the set of 
unistochastic matrices, since these bistochastic matrices which do not belong to 
U3 are located close to the boundary of the Birkhoff polytope and are charac- 
terized by small entropy. 

Let us now turn to the generalized entropy defined for any probability vector 
{p^■.l<^<N} 

N 



1=1 

The parameter q ^ I is assumed to be non negative. In the limiting case the 
generalized entropy converges to the standard (Shannon) entropy, limq^i Sq = 
-E»=iP»lnpi. 

Applying definition p8p to a unistochastic matrix B (z similarly to the 
ordinary case we let 



' 1=1 7=1 



We compute the expected value of this expression with respect to h/^^Q (6)'^ ^ db 
on for /c > i 



2 ■ 



It is the same as the expected value of {Bn — Bf^), indeed 



3r (3/0 

n 

3r {k + q)r (3fc) 



q-l\ T{k)T{3k + q) 

When k is an integer number n or a half-integer k = n + ^ this expression is 
a rational function of q and can be expressed with help of the Pochhammer 
symbol {x)^ defined above, 

<^'V,.^J^(l-S^). (39) 
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In particular, taking n = 1 we arrive at handy expressions for the mean gene- 
reahzed entropies averaged over the Haar measure and flat measure respectively, 
which allow for an explicit partial fraction expansion, 



" (g + l)(g + 2) " 9+1 9+2' ^^^^ 

2(V + 34g+105) _ 63 45 35 



(2g + 3)(2g + 5)(2g + 7) 4 (2g + 3) 2 (2q + 5) 4 (2g + 7) ' 

For completeness we provide also an expression for the generalized entropy 
averaged over the set B-j, with respect to the flat measure obtained with help of 
Corollary [2] 

These entropies characterize well the distribution of matrices generated by 
these measures. In particular, a comparison of both expressions in (|42p shows 
that the Haar measure on ?7(3) populates the region close to the boundary of U-;, 
more densely then the vicinity of the flat matrix W around its center. Since the 
squared Jarlskog invariant jg i-,y construction equal to zero at the boundary 
of Z^3, we may expect that its mean value over the flat measure /ii is smaller 
than the average with respect to the Haar measure /i3/2- As shown in the next 
section this is indeed the case. 



6 Distribution of the Jarlskog invariant 

The value of the Jarlskog invariant and its square at a cross-section of the set 
of unistochastic matrices is shown in Fig. [H Recent papers of Gibbons 
et al. [12, 30] analyzed squared Jarlskog invariant [8, 9] averaged over several 
probability measures on the set of unitary matrices. In particular these authors 
computed the expectation value, ( J^), averaged over the 'flag manifold' measure 
induced by the Haar measure on U (3) and analyzed numerically the probability 
distribution P(| J|) with respect to this measure. In this section we proceed one 
step further and derive an analytical formula for this probability distribution. 

We shall start computing the moments of the distribution of the variable 
Q = 4 J2 defined in HH) as a function of a random unistochastic matrix B. This 
task is rather simple, since we can express the moments of Q with respect to 
any measure /ife by the coefficients defined in (|30p. 

_ z^.+n _ 3n (fc - i) r (fc + 1) r (fc + f ) r (fc + nf 

^ " hk + i)r(fc + n+i)r(fc + n+|)r(fc)2 

_ o-3n (fc — ^) (fc)„ 

(fc + n-i)(fc + i)„(fc+|)„' 
where k determines the measure (|28p while n = 0,l,2,.... 
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Figure 4: The absolute value of the Jarlskog invariant |J| a), and its square 
b), at the cross-section of Uz along the plane formed by two permutation 
matrices and the identity. Dark color denotes high values of | J| and J^. The 
maximum is achieved at the van der Waerden matrix W located the center of 
the deltoid. Note that outside the deltoid < and the bistochastic matrix 
is not unistochastic. 



Setting k — \ and n — \ we find that the mean squared Jarlskog invariant, 
averaged over the Haar measure reads (J^)i = (Q/4)i — 1/720 = 1.389 x 10~^ 
in consistence with [30, eqn. (75)]. For comparison note that the average over 
the flat measure yields a larger value, {J'^)z/2 = 3/1144 = 2.622 x 10""^. In 
general the flat measure favors larger values of | J| as it is shown in FiglS] 

Having at our disposal the complete set of the moments of Q we will de- 
termine the exact distribution function P{Q) in terms of hypergeometric and 
related functions. To avoid nuisance factors in the calculations we will consider 
the random variable X := 27Q = 108 so that X takes values in [0, 1]. 

We know that the following relations hold for any A; > 1/2 and n = 0, 1, 2, • • • 

k — — ^ [ a^Q^Q ^ dxQ ~ , ^ , 



VJo " " (fc + n-i) 



In view of the expression (|43p for the moments the above relations allow 
us to find an alternative representation of the desired probability distribution 
PiX). 
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Let Xq , Xi , X2 be independent random variables with the densities 



respectively, each being defined on < 2: < 1. 

Then X has the same moments and the same probability distribution as the 
product XoXiX2- This step, justified in Appendix [Bl enables us to arrive at the 
key result of this section: an explicit expression for the probability distribution 
for X = 108 J^, where J denotes the Jarlskog invariant of a random unistochastic 
matrix generated according to the measure , 

Pk {X) = c, [k - 1) j'^ Q, 1 1; 1 - t-^l^dt. (47) 

It is assumed here that < X < 1 and k > 1/2, the symbol 2F1 stands for the 
hypergeometric function, while the normalization constant reads 

r(fc + i)r(fc + |) 

Cfc := — (48) 

r(fc)' ^ ^ 

In Appendix |B] we made use of this expression to determine an explicit 
expansion for the density function Pk{X). These results allowed us to show the 
distributions -Pi (AT) and P3/2{X) in Fig. [5l Observe that in the case fc = 1 
wc obtain an expression for the distribution of | J|, considered as a function of 
a random unitary matrix U distributed with respect to the Haar measure on 
J7(3) random variable on For small values of |J| this distribution behaves as 
P(| J|) - a\J\^ with A = and a = 87r w 25.133. 

To make a direct connection with the results of [30]. we reproduce here 
the formula for the integrated probability distribution. For any < y < 
the probability of finding a unitary matrix U distributed according to the Haar 
measure on U{3) with | J(i3)| less or equal y reads, 

f 24384 
Pi{\J\ < y} = 87ry+<^ 241n(42/2) (y^ + 4^"* + 96/ + • • • ) - 72y2 + 128y'^ ' 



5 
(49) 

In Appendix [Bl we derive this formula obtained for the measure ni on as 
well as an analogous result for the flat measure fJ-3/2 

P3/2 {\J\<y}^ 420y2 - ^y3 • • • + ^y^ In {4y') + ■■■ (50) 

TT TT 

Both cumulative distribution functions are compared in Fig. [6l 
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Figure 5: Probability distribution P^d J|) of the Jarlskog invariant for random 
unistochastic matrices generated according to the Haar measure /Lti and the flat 
measure /i3/2- 
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Figure 6: Cumulative probability distribution for the functions plotted in Fig. 
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Recent experimental data show that the observed value of the Jarlskog in- 
variant reads [34] 

Jobs = 3.08 [+0.16,-0.18] X 10"^ (51) 

This concrete number can be now compared with the probability distribution 
(gH). A 95% confidence interval for | J| is about [0.00202, 1/6^3 w 0.096], while 
the probability of getting a value of | J| outside this interval is 5%. 

Moreover, we get an explicit estimate for the probability of obtaining at 
random a unitary matrix, such that the absolute values of its Jarlskog invariant 
is smaller then the observed value, 

P{\J\< Jobs} = 7.74 [+0.41, -0.45] x 10"^ (52) 

The statistical hypothesis that the CKM matrix arises from the probability 
experiment of producing a random unitary matrix, with respect to Haar measure 
on U (3), is rejected at the descriptive significance level of 0.08%. 

Another benchmark introduced in Gibbons et al [30], Pflag(|J| < 10~*) « 
2.5085 X 10""^ is consistent with numerical data obtained in Eq.(92) of that 
paper. Note that Gibbons et al [30] constructed several probability models for 
which values as small or smaller than Jobs are more likely. 

For comparison we note that the flat measure /i3/2 in the set of unistochastic 
matrices yields a smaller probability. Using this measure (fc = |) we obtain 
-f3/2{|'^l ^ Jobs} ~ 3.98 X 10~^. Indeed this could be viewed as statistical 
evidence that the transition probabilities in the CKM matrix do arise from a 
unitary matrix. Specifically the so-called likelihood ratio test applied to the two 
probability densities for | J| induced by dfii and ^^d^3/2 (the factor comes 

from P{Q < 0} = 1 - for the flat measure on B3) at | J| = 3.08 x 10"^ 
results in a factor of about 1200. This value is obtained from the formulas for 
/o {x) in Appendix IbI 

Let us express the Jarlskog invariant (fTO|) by the standard parameters (|19p 
of a unitary matrix of order three, 

J{U) = -Ci2C23Ci3Si2S23Sl3Sin(5 (53) 

Observed value of the Jarlskog invariant for the CKM matrix does not imply 
that the CP violating phase 5c km had to be small. In fact 5c km G [62°, 100°] 
so even the value 7r/2 is not ruled out - see e.g. [43]. Hence the small value (I5ip 
is due to the angles Oij in p9|) . which determine the bistochastic matrix. 

Thus we are going to conclude this section with a simple statistical state- 
ment: The CKM matrix should not be considered as a generic unitary matrix 
drawn at random with respect to the Haar measure on U{2>). Furthermore, the 
matrix Bckm of squared entries of Vckm is rather unlikely to be an ordinary 
bistochastic matrix generated at random with respect to the fiat measure in this 
set. 
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7 Concluding Remarks 



In this work we have analyzed the Birkhoff polytope B3 of N = 3 bistochastic 
matrices and its subset U3 of unistochastic matrices. This set contains these 
bistochastic matrices which arise from squared moduh of entries of a unitary 
matrix. We have improved the result of [27] by computing the exact volume of 
U3 with respect to the flat (Lebesgue) measure and found that it takes more 
than three quarters of the volume of the Birkhoff polytope B3. 

We have introduced a one-parameter family of probability measures /ifc into 
the set U3 of unistochastic matrices. Among the measures (|28l) the distinguished 
ones are the uniform (flat) measure /i3/2 and the measure /ii, induced by the 
Haar measure on C/(3). Furthermore, the measure fj,k obtained in the limit k — > 
i coincides with the measure induced by the Haar measure on the orthogonal 
group O (3). 

We derived explicit formulae which allow us to compute expectation values 
of a smooth function of an entry of B with respect to these measures. In this way 
we derived exact expressions for the mean entropy and the generalized entropy 
of a random unistochastic matrix with respect to the measures fik- These values 
can serve as a reference values in studying properties of concrete unitary and 
bistochastic matrices of order three, used in the theory of quantum information. 

In high energy physics and the theory of CP symmetry breaking one works 
with unitary matrices of order 3 and characterizes them by the Jarlskog invari- 
ant (jlOp . Computing all the moments of the squared Jarlskog invariant with 
respect to the measure fik we could represent the probability distribution Pk{J) 
as an integral (|47p of the hypergeometric function 2F1. Expanding this function 
in a series and integrating it term by term we arrived at an explicit represen- 
tation of the desired probability distributions. In particular, working with the 
Haar measure fii we could derive analytical results on the distribution Pi(| J|) 
consistent with the numerical results earlier obtained in Gibbons et al. [30]. 
Our results support then the observation, that the unitary CKM matrix Vckm, 
which describes the violation of the CP symmetry, should not be regarded as a 
generic unitary matrix of order 3. 
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A Extreme values of the parameter Q 

Consider a bistochastic matrix M of order three parametrized by ([7]) and the 
function Q{M) defined in p^ . The aim of this section is to show that for any 
M S S3 this function takes values in [—1/16, 1/27]. Note that if the matrix is 
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unistochastic, M E U3 than Q is non negative and is proportional to the squared 
area of the unitarity triangle. 

It is straightforward to show that Q is invariant under transposition and per- 
mutation of rows or columns (for example replacing (&2, ^4) by (1 — 61 — 62; 1 ^ ^3 
Let us first introduce parameters bi,s,t,x with < bi,s,t < 1 and conditions 
on X to be determined. Motivated by the unistochastic situation let 

62 = (l-6i)s, 

63 = (1-61) 

64 = (1 - s) (1 - i) + hst + X. 

Four more conditions must be satisfied for M (b) to be bistochastic (the in- 
equalities M (&)]^3 > and M {b)^-^^ > arc already satisfied). The simultaneous 
inequalities M (6)23 > and M (6)32 > are equivalent to 

a; < min (ui, U2) , (54) 
ui := s{l-t) + bit{l- s) 
U2 ■.= t{l- s) + bis{l-t) 

and {M (6)22 > 0, M (6)33 > 0} is equivalent to 

-X <mini£i,£2) , (55) 
£1 := (1 - s) {l-t) + bist, 
£2 := st + bi{l-s){l-t). 

With these parameters 

Q (6) = Q' {bi, s, t, x) :=-(!- bif {x^ - Abist (1 - s) (1 - t)) . 

For fixed bi,s,t this is decreasing in x^; thus the maximum value occurs at x = 
and then maximizing over bi,s,t wc obtain Q{b) = ^ when b\ = = ^ = t 
(that is, M {b).j = 5 for 1 < i, j < 3). 

Next we show that the minimum value of Q on B3 is — achieved at the 
Schur matrix (1): by permutations of rows or columns, and then transposition 
of M, if necessary, we may assume < t < s < ^. In this triangle the bounds 
—£2 < X < U2 apply. Rather than considering min {Q' (61, s, t, —£2) , Q' (61, s, t, U2 
in this region we will minimize Q' {bi, s,t, —£2) in the triangle bounded by 
t = 0,t = s,t = 1 — s (vertices (0,0), (5,5) ,(1,0)); this works because = 
Q' (61, s,t,U2 {bi,s,t)) = Q' (61, 1 - s,t, -£2 (61, 1 - s,t)) (writing ^2, ^2 as func- 
tions of {bi,s,t)). Then 

Q' (61, 1 - s, t, -I2 (61, 1 - s, t)) = - (1 - bxf (61 (1 -s){\-t)- stf . 

As a function of (s,i), Q' can not have an interior minimum, so it suffices to 
check the edges of the triangle. On the edge t = we have Q' = — {1 — b{f b\ (1 — 
with minimum value of — ^ at 61 = 5, s = 0. 
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On the edge t = 1 — s wc obtain Q' — — {1 ~ (1 — s)^ with minimum 

value of — at &i = 0, s = i = i (the Schur matrix for x — — -j). 

On the edge s = t,Q' — — {1 — bi)^ (^bi (1 — s)^ — s^^ . This function has 

no interior minimum on the interval < s < |. The endpoints s — and s — ^ 
have already been considered. 

There is a neat formula for the integral of functions of(6i,62) over B3 with 
respect to the flat measure db :— Ylt^i'^^i- The derivation of the formula 
involves adding over the four regions formed in the unit square by the lines 
s = t,s + t = l. 

Proposition 1 Let f be integrable on {(x, y) '■ x,y > 0,x + y < 1}, then 




f{biM)db^ db, f{bi,b2){bib2 + {bi+b2){l-b,-b2))db2. 

JQ JQ 



Observe that the integral kernel is an elementary symmetric function of 
{61,52,1-61-62). 

Corollary 2 Let f be integrable on [0, 1] then 

I f (bi) db=l r f (61) (1 + 56i) (1 - 61)' d6i. 



Thus/g^ ld6-i. 

When considering Bs as a subset of the element of volume db = Y[t=i '^^i 
is multiplied by 9. The map i? : ^ K^, defined in equation [7] is affine onto a 
4-dimensional linear manifold (translate of a subspace) and its Jacobian equals 
9, calculated as the square root of the determinant of the Gram matrix of the 
images of the unit vectors relative to i? ^ ^ For example 



61 



1 




"0 


1 " 




" 1 





-1" 





1 


+ 61 











1 


1 -1 




-1 





1 



the Gram matrix is 



, and its determinant equals 81. 



In this way we may obtain the average entropy and directly derive expression 
(|42|) for the generalized entropy. Furthermore, 8 Jg^ Q (6) db = jig , 8 Jg^ Q (6)^ db 
thus the standard deviation reads ag = 0.01153 . . .. 

Since the set of unistochastic matrices is the subset of B3 for which Q > 0, 
equivalently 

<i?:={46l.s^(l-,s)(l-^)}l/^ 
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we see that 

f pi pi pi pR 

/ f{bi)db= f [bi) [1 - bif dbi ds dt dx 

Jua Jo Jo Jo J-R 

-^l' fibi)by'{^-bifdb,, 

and J^_^ Idb = vr^/lOS in agreement with eq. ([31]) . Observe that R < min {li, hjUi, 
by the (well-known) inequality 2y/xy < x + y for x,y > 0; this is the reason 
that the integral extends over < s, i < 1. 



B Jarlskog invariant as a product of three ran- 
dom variables 

In this appendix we show that the product of three random variables X0X1X2 
introduced in (|44l - [46]) has the same probability distribution as the rescaled 
squared Jarlskog invariant X := 27Q = 108 of random unistochastic matrices 
generated with respect to the measure /ifc defined in (f28|) . We shall start by 
quoting the lemma on probability distribution of a product of two independent 
random variables 

Lemma 3 Suppose Yi,Y2 are random variables on [0,1] with densities gi and 
c.d.f.'s Gi,i = 1,2 (that is, G.,{x) = gi{t)dt = P {Y, < x}, < x < 1). 
Then the density for Y1Y2 is gi (t) g2 (f ) jdt. 

Let us apply this lemma, which can be proved by direct integration, to a 
random variable Y2 = Xq distributed as in (f44|) . 

Corollary 4 If g2 (t) = ^ 5) t''^^ with k > ^ then the density ofYiY2 is 
(^k - x'^-i gi {t) t^'^-^dt, < X < 1. 

Making use of the normalization constant Ck introduced in (j48p we can write 
down explicit form for the density of the product XiX2. 

Proposition 5 The density /12 of XxX2 is Ckx''"^ 2P1 (-1, |; 1; 1 — a^) , < x < 
1. 

Proof. By Lemma [H the density is 

"r(i)r(|) .0 



3/ " V3^ 
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using the substitution t — x + s {1 — x). Now restrict a; to i < a; < 1 then 
< < 1 and we can expand 



{x + s {1 - x)r^^^ ^ x-^/^ 



1 \ -1/3 
.T — 1 ^ 



X 

oo fl\ 



r?,l 



n! 

n=0 



Integrating term-by-term we obtain 

^ r(i)r(|)„^„ n!r(n + i) 



^ n\n\ 
^ ^^{\) ^ " (1) m!(-l)" 

fc— 1 \ ^ V 3 / m /I \ V 3 / n 

^ m\ ^ ' ^n!(l) (m-n)!' 



where the summation variables are changed to n and m = n + j. The inner sum 
is evaluated with the Chu-Vandermonde sum 



^ n!(l) (m-n)! ^ n! (1) 



■1=0 



(1)™ ■ 

This shows that /12 (a;) = c^x^'^ iF\ (i, |; 1; 1 — at least for ^ < a; < 1, 
but both sides are analytic on < a; < 2 so the equality holds for < a; < 1. ■ 

To derive an expression for the density of the triple product X = XqX\\i 
we need to combine lemma [3] with corollary |4l Hence we can write / (a;) = 
(A; - \) (t) t^l^-^dt and /12 it) = c^t''-^ 2F1 (i, |; 1; 1 - t) since 

^fc-i^i/2-fc _ This completes the proof of formula (|T7l) for the distribution 

of the rescaled squared Jarlskog invariant X :— 27Q — 108J^. 

Following Gibbons et al [30] we shall now concentrate on the probabil- 
ity distribution for the absolute value of the Jarlskog invariant, | J|, equal to 
. Let fo{x) denote the density function of X^/^, thus /o (a;) = 
2a;/ (a;^). It is not hard to compute a series for /o (a:) when x is near 1. We 
change the variable of integration t ~ {1 — s)'^ and obtain 

fo {x) = 4cfe (k - i") x^'^-' f 2F1 Q, 1 1; s (2 - s)) ds. 
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Wc expand (s (2 — s)) ^ (for j = 1,2,. . .), collect the coefficients of and 
integrate term-by-term to get 



„ fl 2 \ (1-.t) \3)m^j \3)rn-j ^ 



there is no nice formula for the inner (j-) sum. Thus 

for a; near 1 (that is, not too close to zero). When fc = 1 we have Ci = 

r(|)r(|) = ^and 

Lemma 6 2F1 (l, - t) t'^/'^dt = 3. 

Proof. Indeed 

f (i 1 1; 1 - t-^^^dt = I 1^ c _ V.,, 

V (!)„(§)„ r(n + i)r(i) r(i) 

n!n! r(n+|) r(|)„^„ n! 

r(|)r(i)r(i) . ^_ 

"r(i)r(|)r(i)-ir(i)r(|)-'"% 

We used the Gauss sum 2F1 (a, b; c; 1) = Y['c-a)r(c-'b) for c > a + 6, and the 
equation r(t)r(l-t) = ^. ■ 

Thus / (x) = Cfc (fc - i) a;'=-3/2 (3 - 2F1 (i, |; 1; 1 - t) f-^/^dt). To ana- 
lyze the behavior for x near zero we use the classical formulas for the hypergeo- 
metric series 2F1 (a, b; c; t) at the singular point t = 1. The special case c = a + b 
is more complicated (see [44, p. 257]): 

^„ := 2V' (n + 1) - ( n + i ) - V' ( n + ^ ) . 



By the triplication formula for the ^-function (recall ip (t) = (t) /T {€)), 

^ ('^ " ^^"^^ - V' (n) - 31n3 

= 3V' (3n + 1) - ^ (n + 1) - 3 ln3. 
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because ip {t + 1) = ijj (t) = j for t > 0; the latter formula is valid for n > 0. 
Thus 



A„ = 3 (V^ (71 + 1) - V (Sn + 1)) + 31n3 



3n 



= - J2 - + 31n3. 



j=n+l 



Also 



^ n+1/2 / 

/ (A„-lni)i"-i/2dt^- ^ -Inx + A. 

7o 2r7, + 1 V 



2n+ 1 



Thus 



27r ^ n!n! 2n + 1 

n=0 



27r ^ n!n! 2n + 1 

n=0 



-21na; + A,-, 



2n + l 
2 

2n+ 1 



and we have found the density function of x = ^/X = 6^/3\J\ exhibiting the 
behavior for x near zero. 

In the expression for /o the first few terms inside the braces {} are 



73 
2tt 



In ( — I I 2a; + —x^ + —x'" 
27 M 27 81 



22 3 49 , 

ix x'^ x^ 

81 405 



The cumulative distribution function Fq (x) — P < x} = P {I J| < X/6V3} 



is 



Po (x) = 2cfe ( fc - - ) x^'^-i 



2A; 



3__ V3^ 
- 1 27r ^ 



(!)„(!)„ ^^"^^ 

n!n! (2n + 1) (n + A:) 



27 ^ i' 



3n 



1 



j=n+l 



J 2n + 1 n + k 



The important cases are: 
1. fc = 1, Haar measure on U (3), 



Fo (X) = i^x-^ f- In (^-) (x- + Ix^ + 
° ^ 9 9 V V 27 / V 27 243 
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7290^ 



2. = |, flat measure, 
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3 _ \/3 / 



Fo (x) = — x^ ■(--—( - In I — 1 I -X H x" 

' 27 \2 2tt \ V 27 M 3 135 



16 86 

X J 

9 2025 



Introducing a new variable, y = x/6\/3, we arrive at expressions (j49p and 
l(50|) presented in Sec. [H 
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C Conjectures of measures in higher dimensions 



The method used in Section [4] rehes on one of the authors' [31] construction 
of a hnear operator commuting with the action of S3, mapping homogeneous 
polynomials in three variables to homogeneous polynomials of the same degree, 
and depending on a parameter k (a particular case of the "Dunkl intertwining 
operator"). This operator is realized as an integral over U (3). The case k — 1 
is based on a formula of Harish-Chandra (see Helgason [45, p. 328]) 

exp (Tr {D (x) UD (y) U*)) dm {U) = ,T r \ J" det (w) exp ((xw, y)) 



where the symmetric group on N objects is identified with the set of permutation 
matrices in O (N), for x,y £ the inner product is {x, y) :— ^jVj^ ^ (^) 

is the diagonal matrix with D (x)^^ = Xj, a (x) := ni<i<j<Ar i^i ~ dm is 
Haar measure and cjv is a constant. The relation to unistochastic matrices 
follows from the equation 



N 

TT{D{x)UD{y)U*) ^ ^ x,|C/,,f y,. 



The aforementioned linear operator is known algebraically, that is, with some 
computational effort one can determine the action on any (low-degree!) poly- 
nomial. For = 3 we were able to find a parametrized family of measures to 
implement the operator, roughly 

pix)^ f p{xf{U))diXk[U), 

Jui3) 

where / {U)-j := \Uij\^ as in ^ for any polynomial p. It is known that 

g(\U,,f)dmiU)^{N~l) / g (t) (1 - tf-' dt, 
'U{N) ^ ^ Ja 

for any continuous function g and any matrix entry Uij . As in Section 5 we can 
compute the average (generalized) entropy for the entries \Uij\^ with respect to 
Haar measure {q ^ 1): 

1 r ^ 



q-1 







9 - 1 V (9 + 1)^-1 



i-iy 



*!(^-2-z)!(* + 2)(g + z + l)' 
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the last equation is the partial fraction decomposition. Also lim^^i (Sq) = 
4' {N + 1) — ijj (2) = X]jL2 7 J ™ agreement with the known results for the mean 
entropy of random complex vectors distributed according to the unitarily in- 
variant measure [41,42]. 

We are thus tempted to speculate that there exists a measure on U (N) 
such that 

/,„, ' (I--/) - rj'm-m [ "> 

with | as above. However we must emphasize that there is an impor- 

tant difference between and Un with > 4. There is a single inequality 
characterizing inside (the condition is Q (b) > 0) while {N — 2)^ inequal- 
ities occur in general [25]. Another difference is that the elements of U can 

not necessarily be determined from the values ||CAj|^ : 1 < i,j < A^| (that is, 
up to left and right multiplication by diagonal unitary matrices and permu- 
tation of rows or columns). For instance for the flat matrix W4 of van der 
Waerden, with all entries equal to 1/4 there exists a one parameter family of 
unitary matrices U{a) (rcscalcd complex Hadamard matrices [38]) such that 
\[Uia)U^^[Wi],, = l/A. 

It would be interesting to be able to fit the "flat" measure on Un (inherited 
from Bn) into the fik framework suggested above, but this appears to be a 
sizable research problem in itself. 
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